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Abstract 

Dana Scott used the partial order among partial functions for his mathe- 
matical model of recursively defined functions. He interpreted the partial order 
as one of information content. In this paper we elaborate on Scott's suggestion 
of regarding computation as a process of information maximization by apply- 
ing it to the solution of constraint satisfaction problems. Here the method of 
constraint propagation can be interpreted as decreasing uncertainty about the 
solution — that is, as gain in information about the solution. As illustrative 
example we choose numerical constraint satisfaction problems to be solved by 
interval constraints. 

To facilitate this approach to constraint solving we formulate constraint 
satisfaction problems as formulas in predicate logic. This necessitates extending 
the usual semantics for predicate logic so that meaning is assigned not only to 
sentences but also to formulas with free variables. 



1 Computation as maximization in information space 

The early history of constraint processing is written in three MIT theses: Suther- 
land's, Waltz's, and Steele's [121 114} llOj . Already in this small selection one can 
discern two radically different approaches. Sutherland and Steele use relaxation: 
starting form a guessed assignment of vlaues to variables, constraints are succes- 
sively used to adjust variables in such a way as to satisfy better the constraint 
under consideration. These authors followed an old idea brought into prominence 
under the name of relaxation by Southwell [llj. 

Waltz adopted a radically different approach (and was, to our knowledge, the 
first to do so). He associated with each of the problem's variables a domain; that is, 
the set of all values that are not a priori impossible. Each constraint is the used to 
eliminate values from the domains of one or more variables affected by the constraint 
that are incompatible with that constraint. In this paper we are concerned with the 
latter method, which we call the domain reduction method. 
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The attraction of domain reduction is its completeness for finite domains: if a 
solution exists, then it will be found. This in contrast with relaxation, which can 
flounder forevei0. 

In this paper we present domain reduction as an example of the view of compu- 
tation as monotonic gain of information. This view was pioneered by Dana Scott, 
who was the first to make mathematical sense [5] of a recursively defined function 
/. He did this by associating with the definition of / a sequence a of partial func- 
tions. If X is such that f{x) requires a recursion depth is at most n, then a„(a;) is 
defined and equal to /(x); otherwise o„(a;) is undefined. Thus a is a sequence of 
partial functions in which each function agrees with the previous one, but is "more 
defined" . 

In general, if two partial functions g and h of the same type are such that h 
is defined wherever g is and such that they have the same value when both are 
defined, then Scott proposed to regard g as an approximation to h and noted that 
this notion of approximation is a partial order in the set of partial functions of 
the same type. Moreover Scott proposed to transfer the information concept from 
random variables, as it was in Shannon's information theory, to partial functions, 
noting that a partial function can be regarded as containing more information than 
functions approximating it. The approach to the semantics of recursive definitions 
can be summarized by saying that every such definition can be regarded as the limit 
of a sequence of approximations each containing more information about the limit 
of the sequence than the previous one. 

Scott was aware that it might seem somewhat far-fetched to give such an inter- 
pretation to the notion of "information". As a justification Scott [9j gave another 
example of a set partially ordered by information: that of numerical intervals. Al- 
though this certainly strengthened the case, this suggestion has not, as far as we 
know, been followed up. In this paper we do so, motivated by the opportunities for 
deeper understanding of constraint solving. 

In numerical applications the view of computation as monotonic gain of informa- 
tion is more than a theoretically interesting insight: it is adds an essential capability. 
Suppose a conventional numerical computation is stopped after 1,000 iterations and 
yields 1.912837465 and that it yields 1.912877134 when allowed to run for 10,000 
iterations, what do we know about the improvement obtained, if any? If results, in- 
termediate and final, were expressed as intervals we would, say, have [1.912, 1.938J1 
after 1,000 iterations and perhaps [1.9216, 1.9283jl after 10,000 iterations. Here we 
see that we know more about the unknown solution as a result of the additional 
computational work. Rephrasing "knowing more" as "gain in information" suggests 
that the effect of iteration in interval arithmetic can be described as "monotonic 
gain of information" . The important qualification "monotonic" is there because in 

^ But, as one may expect, domain reduction is no cure-all. For some problems, relaxation 
quickly finds a solution, and domain reduction requires an infeasible amount of time. The n-queens 
problem for large n is an example. Van Hentenryck and Michel page 89, mention n = 10, 000 
as a routine example for relaxation in combination with their search technique. 

^ Note the smaller number of decimals: with intervals it becomes clear that additional decimals 
would be meaningless. 

^ The smaller interval warrants another decimal. 
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interval arithmetic we never need to settle for less information as a result of addi- 
tional computational work, though we may fail to make a gain. Moreover, such a 
stalling of progress is a useful criterion for halting the iteration. 

Because of the special importance of solving constraint satisfaction problems over 
the reals by means of floating-point arithmetic, we choose our example problem from 
this area. Section [2] gives the needed review of interval methods; section [3] describes 
the example. The new view of domain reduction as monotonic information gain 
is used in Section [5] to develop the method from first principles. This suggests 
regarding the set of constraints in a constraint satisfaction problem as a formula 
in predicate logic with a fixed interpretation of predicate symbols. The standard 
semantics only assigns meanings to closed formulas, whereas here we have a formula 
with free variables. Accordingly, in Section |4] we develop the required extension of 
the semantics of predicate logic. This needs a novel treatment of relations, also in 
this section. 

2 Interval arithmetic and interval constraints 

To facilitate the use of information in computation we do not use interval arithmetic 
directly, but indirectly via a constraint satisfaction problem (CSP). Such problems 
are solved by associating with each unknown a set of possible values instead of the 
usual single value. This is especially appropriate for real- valued unknowns, as these 
sets take the form of intervals with floating-point numbers as bounds. This special 
case of CSP solving is called interval constraints [Ull]. 

We introduce interval constraints by means of an example. In interval arithmetic 
the rule for adding intervals is 

[a, b] + [c,d\ = {x + y : X e [a,b] Ay £ [c, d\} 

so that, e.g., [0,2] + [0,2] = [0,4]. The analogous operation in interval constraints 
starts by defining the constraint sum{x, y, z) which holds between the reals x, y, and 
z iS X + y = z. In other words, the formula sum{x, y, z) is true whenever x + y = z. 
This leads to the following inference 

sum{x, y, z) 
a; G [0, 2] A y e [0, 2] A z G [-oo, +oo] 
X G [0, 2] A y G [0, 2] A z G [0, 4] 

We use here the conventional format for inference: the premisses above the horizon- 
tal line; the conclusion below. The above inference coincides, in this special case, 
with interval arithmetic. Only the interval for z is narrowed. 

In interval constraints we may have a priori constraints on all variables, as in 

sum{x, y, z) 
x G [0, 2] A y G [0, 2] A z G [3, 5] 
X G [1,2] Ay G [1,2] Az G [3,4] 

Here the intervals for all three variables are narrowed. As a result, the effect of 
the operation can no longer be exclusively characterized as an addition or as its 
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inverse: the effect is a mixture of several operations. We can formulate the effect 
algebraically as applying an operator, the contraction operator of the constraint 
sum, that maps triples of intervals to triples of intervals, in this case as 

([0,2], [0,2], [3, 5]) ^([1,2], [1,2], [3, 4]). (1) 

The righthand side of ([T]) is the smallest triple ("box") that can be inferred: any 
box that is strictly smaller would exclude points that are possible according the 
given premises of the inference. Thus this box is the optimal solution to the given 
constraint-satisfaction problem. The optimal solution is obtained by one addition 
and two subtractions of interval arithmetic plus a few bound comparisons. Similarly 
efficient algorithms exist for some other constraints, such as product, integer power, 
trigonometric and logarithmic functions. 

We may express the contraction operator for the sum constraint as a mapping 
from a tuple B of intervals to the least such tuple containing the intersection of B 
and the constraint. 

In general a CSP is a conjunction of many constraints. After applying the 
contraction operator for each of these once, it is often the case that another round 
of applications yields further contractions in the intervals for some of the variables. 
As the contractions are implemented in floating-point interval arithmetic and are 
assured valid by outward rounding, there is a limit and it is reached after a finite 
number of rounds of contractions. 

In each of the rounds it may happen that a constraint is found that does not 
contain variables for which a bound has changed. In such cases the contraction 
operator for that constraint has no effect and can be skipped. Algorithms have been 
developed that perform such optimizations [IJ. 



3 An example of solving by interval constraints 

Let us consider the problem of determining the intersection points of a parabola and 
a circle. For example, to solve the system 

with X G [0, 1] and y G [0, 1]. One can eliminate y and solve instead x'^ + x'^ = 1, 
which has two real roots. However, for the purpose of illustrating solving by interval 
constraints, we ignore this opportunity for simplification and we numerically solve 
the original system ([2]). 

The method of interval constraints applies to a class of constraints in the form 
of equalities or inequalities between real-valued expressions. The sum constraint 
in Section [2] is an example: it takes the form of the equation x + y = z. As 
we mentioned in that section, there is an efficient implementation of the optimal 
contraction operator for it. 

The second equation in ([2]) is not primitive; it has to be transformed to an 
equivalent set of primitive constraints. In this example the primitive constraints sq, 
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sum, and one are needed. The constraint sq{u, v) is defined as = v, sum{u, v, w) 
is defined as u + v = w, and one{u) is defined as u = 1. In this way ^ becomes the 
following set of constraints: 

{sq{x, y), sq{y, z), sum{y, z, u), one(n)}. (3) 

The unknowns x, y, z, and u are real numbers. The introduction of z and u is the 
result of reducing the given constraints to primitive ones. In more typical cases the 
given constraints are so complex that the introduced variables greatly outnumber 
the original ones. 

In the example it is given that x, y and z satisfy the above constraints. From 
the original problem statement we have in addition that x G [0,1] and y G [0,1]. 
Of the auxiliary unknowns z and u we initially know nothing: z £ [— oo, +oo] and 
u G [— oo, +oo]. 

In effect, we have transformed ([2|) to the system 

2 

y = X 

" = (4) 

y + z = u 
u = 1 

Instead of solving the original system ([2]) we solve equivalently the constraints 
([3]). This is done by repeatedly applying in arbitrary order the contraction operators 
until there is no change in any of the intervals associated with the unknowns. Ap- 
plying the contraction operators of sq{y, z) and one{u) results in a drastic narrowing 
of the intervals for z and u: they change from [— oo, +00] to [0, 1] for z and to [1, 1] 
for u. After this, none of the contraction operators of the four constraints results in 
a change. Therefore this is as far contraction operator application can take us. 

To obtain more information about possibly existing solutions, we split the CSP 
with interval X = [0, 1] for unknown x into two CSPs that are identical except for 
the intervals of x. In the first CSP the interval for x is the left half of X; in the 
second CSP it is the right half. Then we start another round of contraction operator 
applications starting from one of the halves as initial box: 

X G [0,^],y G [0,1], z G [0,1], n G [-00, +00]. (5) 

Applying the contraction operator for sq{x,y) results in y G [0,1/4]. Applying the 
contraction operator for sq{y,z) results in 2; G [0,1/16]. Applying the contraction 
operator for sum{y, z,u) results in u G [0,5/16]. Applying the contraction operator 
for one{u) causes the interval for u to become empty. This proves that there is no 
solution in the initial box ([5]). 
We now turn to the other half: 

X G , 1] , y G [0, 1] , z G [0, 1] , n G [-00, +00] . (6) 

Applying the contraction operator for sq{x,y) results in y G Continuing in 

tabular form gives 
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Interval 




X 


y 




u 




[0.5, 1] 


[0,1] 


[0,1] 


[-00, +oo] 


Apply 

sq{x,y) 
one{u) 
sum{y, z, u) 
sq{y, z) 






[0,|] 


[1,1] 



Now the intervals for x and y continue getting smaller until the least floating- 
point box has been reached that contains a solution: the intervals for x converge 
to a small interval containing \/(|(\/5 — 1)), while the intervals for y converge to a 
small interval containing ^(\/5 — 1). 

4 Notation and terminology for relations and constraints 

We take it that ([3]) is intuitively clear, but how do we characterize mathematically 
any solutions that such a CSP may have and how do we characterize mathematically 
an algorithm for obtaining such a solution? Consider for example the constraints 
sq{x,y) and sq{y,z). They clearly have something in common: sq, which must be 
some kind of relation. But the constraints are different from each other (otherwise 
their conjunction could be simplified by dropping either of them) and also different 
from sq, whatever that may be. 

In this section we develop a set-theoretic formulation of constraint-satisfaction 
problems and illustrate it by the example in Section [3l We find that such a formula- 
tion is facilitated by a treatment of relations and operations on them that is in the 
spirit of the conventional treatment, but differs in details. In particular, we need 
to clarify the difference between relations and constraints as well as the connection 
between these. 

4.1 Functions 

We denote by — )• T the set of total functions that are defined on S and have values 
in T. If / G (5 ^ T) we say that / "has type" S ^T. li S' C S, then we define 
fs', the restriction of / to S' as the function in S" — t- T such that for all x € S' we 
have fs'{x) = f{x). 

4.2 Tuples 

As is the case conventionally, our relations are sets of tuples of the same arity. 
However, we need the possibility to index tuples either by variables or by the con- 
ventional indexes {0, 1,2,...}. Hence we define a tuple as an element of the function 
set I — )• T, where / is an arbitrary set to serve as index set. / — >■ T is the type of 
the tuple. 

Example If t is tuple in {x, y} — t- TZ, then we may have tx = 1.1 and ty = 1.21. 
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Example t G 3 — )• {a,b,c}, where 3 = {0,1,2} and to = b, ti = c, and t2 = c. 
In cases like this, where the index set is an ordinal, we use the compact notation 
t = [b, c, c]. In general, we write n for {0, . . . , n — 1}. 

When a function is regarded as a tuple, then the restriction operation on func- 
tions is called projection. E.g. if t = [2, 1, 3] and t' = t{o,2}) then = 2 and t'2 = 3; 
t'l is not defined. 

4.3 Approximation structures 

In [9] Dana Scott proposed that computation steps be viewed as transitions in a 
partially ordered space of data. In his view computation consists of generating a 
time-ordered sequence dQ,di,d2, ■ ■ ■ with the property that the successive data di 
are each approximated by the previous in the sense of holding information about 
the limit of the sequence that is compatible and is at least as informative. We write 
do E c?i E ^2 E ■ ■ ■ where E is the partial order. 

Scott was primarily interested in using his approach to model mathematically 
the evaluation of recursively defined functions. This requires mathematically rather 
sophisticated constructions. However, the idea also applies to situations covered by 
the following definition. 

Definition 1 An approximation structure for a set D is a set A of subsets of D 
such that (1) A is closed under finite intersection, (2) A is closed under intersection 
of (possibly infinite) ^-descending chains of subsets, and (3) A contains D as an 
element. The information order E of A is defined as the inverse of the inclusion C 
of subsets. 

An approximation domain is a pair {D, A) formed by a set D and an approxima- 
tion structure A on D. {D,A) will be sometimes referred to as D when no ambiguity 
arises regarding A. 

Lemma 1 If D' C D, then there exists in any approximation structure for D a 
<^-least element containing D' . 

Definition 2 If A is an approximation structure for D, then for D' Q D we define 
a^iD') to be the least element of A containing D' . 

The set aAiD') corresponds to the maximum amount of information about D' that 
is expressible within approximation structure A. 

Example The intervals form an approximation structure in the set TZ of real 
numbers, where we define an interval as {x G TZ : a < x < b}, where a £ TZU {—00} 
and b gTZL) {+00}. We write [a, 6] for this interval. Note that with this definition, 
e.g., +00 [0, +00]. 

Example Let F be a subset of the set TZ of reals. The F-intervals are an ap- 
proximation structure in TZ, where an F-interval is {x G 7^ : a < 2; < 6} where 
a G F L) {—00} and b £ F L) {-|-oo}. An important example is the case where F 
is the set of finite double-length IEEE-standard floating-point numbers. The latter 
include —00 and +00, so that pairs of these numbers are a convenient representation 
for the elements of this approximation structure. 
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4.4 Relations 



A relation is a set of tuples with the same type. This type is the type of the relation. 

If r is a relation with type I ^ T, then the projection of r on /' C / is {/' G 
/' ^ T : 3/ G r.fp = f}. 

Example 

sum = {[x,y, z] £ {3 ^ TZ) : X + y = z} is a relation of type 3 — )• 7^. Compare this 
relation to the relation a = {s £ {{x, y, z} ^ TV) : Sx + Sy = Sz}- As their types are 
different, they are different relations; [2,2,4] G sum is not the same tuple as s £ a 
where Sx = 2, Sy = 2, and Sz = 4. 
Example 

If S has one element, then a relation of type 5" — )• T is a unary relation. Such a 
relation is often identified with a subset of T. For example, for a in T^U {— oo} and 
6 in 7^ U {+oo}, {/ G {{x} — s- TZ) : a < fx < b} is a unary relation that is often 
identified with the interval [a,b]. Maintaining the distinction between the two is 
important in the current setting (see Section 14. 6p . 

Definition 3Ifro and ri are relations with types Iq ^ T and Ii — )• T, respectively, 
then the join rg N ri o/tq and ri is 

{/ G [Iq Uh)^T: fj^ G ro and fi, G ri}. 

The join of relations that have disjoint index sets is called the product of these 
relations. 

We avoid the term "Cartesian product" because it is usually understood to 
consist of tuples with index set {0, . . . , n — 1} for some natural number n. 

Definition 4 Let r be a relation of type I ^ T and let I C J. Then we write 
the cylinder on r with respect to JasiTjr and define it as the greatest relation 
g CI [J ^ T) such that irjg = r. 

Cylindrification is inverse to projection in the sense that 7r/(7rJ^r) = r. 

Definition 5 Let I = {zq, . . . ,in-i} be an index set. A box is a product of unary 
relations tq C {ig} D, . . . , r^-i C {in-i} D. In case ro, . . . , r„_i are intervals, 
then one may refer to the box as an interval box. 

4.5 Boxes as approximation domain 

Lemma 2 Let I = {i^, . . . , in~i} be a finite index set and let B be the set of boxes 
of type I ^ D. Then {I ^ D,B) is an approximation domain. 

Proof We need to show the three defining properties (Definition [T]) . In this case one 
can show closure under arbitrary finite or infinite intersection, so that the first two 
properties can be established simultaneously. 

Let {r^ : j G J} be a possibly infinite family of boxes, r-' = Tq N • • • M r^^_^, with 

C {i^} — D for all k £ n. 



8 



Let 
Then 



r=p|H = P|(r^oN...Nr^;_i). 



/ e r = Pi 
jeJ 

<^ yj e J. f e 

<^ Vj eJ.yk€ n. fi^ e rl 
<^ VA; G n. Vj G J. fi^ G 
^ Vfc G n. /i, G P ri 



n-l 



jeJ jeJ 



Hence 




J 

n-l 



is also a box, so that the intersection of a possibly infinite family of boxes is a box. 
We finally need to show that the full relation r = I ^ D is a box. Letting 
= {ik} — >■ -D, we have that 



Therefore, for every relation r of type {io, . . . ,in-i} D there is a least box 
containing r, which justifies the following definition. 

Definition 6 The box operator applied to a relation r with type {zq, . . . , in-i} D 
is the least box Or that contains r. 

4.6 Constraints 

A constraint is a syntactic entity that is used to denote a relation. A constraint 
has the form of an atomic formula in a theory of predicate logic without function 
symbols. The semantics of predicate logic assigns a relation r to an atomic formula 
p{qo, . . . , Qn-i) with set V of variables. The relation r depends on the interpretation 
of p and on the tuple [qq, . . . , Qn-i] of arguments. These arguments are variables, not 
necessarily all different. The first-order predicate logic interpretation of the language 
of atomic formulas, which identifies the argument occurrences by numerical indexes, 
forces n = {0, . . . , n — 1} to be the index set of the relation M{p), the relation that is 



({io}^I))N...N({z„_i}^D) = 
{io,...,in-i} ^ D = 
I^D 



is a box. 



□ 
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the meaning of the predicate symbol p under the given interpretation. In our setting, 
instead, the index set associated with the constraint denoted by p{qo, . . . ,qn-i) is 
the set V of distinct variables occurring in atomic formula p{qo, ■ ■ ■ , Qn-i)- 

The interpretation M that assigns a relation of type {0, . . . , n — 1} to an n-ary 
predicate symbol p needs to be extended to an interpretation M that also assigns 
a relation of type V ^ D to a, constraint c = p{qo, ■ ■ ■ , qn-i) with V as set of free 
variables: 



Definition 7 

M(c) = {aeV^D: [a((?o), . . . , a(g„_i)] G M{p)}. 

As a result of this definition the meaning of a constraint c with set V of variables 
is a relation of type V ^ D. One can view the argument tuple of a constraint as 
an operator that converts a relation M{p) of type n — )• to relation M(c) of type 
V ^ D. 

This is an extension of the usual semantics of predicate logic. 



Example 

Given that M{sq) = {/ G ({0, 1} ^ 7^) : /i = f^] we have 

M{sq{x, y)) = {a G {{x, y] ^ U) : [a{x),a{y)] G M{sq)] 

= {a G {{x, y}^TZ): a{xf = a{y)} 

M{sq{y, x)) = {a G ({x, y} ^ TZ) : [a{y),a{x)] G M{sq)} 

= {a G ({x, y}^n): a{yf = a{x)} 

M{sq{x,x)) = {ae {{x} ^n) ■.[a{x),a{x)]£ M{sq)} 

= {ae {{x} ^n) : a{x) = Qy a{x) = 1} 

Definition 8 A tuple f £ V ^ D satisfies a constraint c if and only if the restric- 
tion of f to the set of variables occurring in c belongs to M{c). 

4.7 Constraint-satisfaction problems 

Definition 9 A constraint-satisfaction problem (CSP) has the form {C,V, D, M) 
and consists of a set C = {sq, . . . , Sm-i} of constraints, a set V, which is the set of 
the variables occurring in the constraints, a set D, the domain of the CSP, and an 
interpretation M , which maps every n-ary predicate symbol occurring in any of the 
constraints to a relation of type n ^ D. A solution to {C,V,D,M) is a G V ^ D 
such that G M{si) for all i G m, where Vi is the set of variables in s{i). 

It follows that the set a of solutions of the CSP is a relation of type V ^ D. 

Example In (C, V, D, M), let C = {sq{x, y), sq{y, z), sum(y, z, u), one{u)} (Equa- 
tion da])), V = {x,y,z,u}, D = n, M{sq) = . . ., M{sum) = and M{one) = .... 
The set a of solutions is a relation a C V ^ TZ such that n^^yja = {po,Pi} 

where poix) = -Vi^iV^ " 1))- Poiv) = UV^ " 1)' Pi(^) = ViUV^ + 1))' and 
Piiy) = UV5-l)- 
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This example shows a CSP with a finite and small solution set. Sudoku puzzles 
are another such example. It often happens that the solution set has an infinite 
number of elements, or a finite number that is too large to list or to process. 

Theorem 1 Let a be the solution set of a CSP C = {sq, ■ ■ ■ , Sm-i} with M as 
interpretation for its predicate symbols. Then we have 

a = M{so) M • • • N M{sm-i)- 

Proof By induction on the size of set {sq, ... , Sm-i}- The base case C = {sq} is 
trivial. 

Assume that the theorem holds for a constraint set Ck = {sq, . . . , Sk-i} of size 
A; > 1, and let cr(Cfc) = M(.so) M ■■• M M(sfc_i) denote the solution set of Cfc. 
Consider constraint set Ck+i = Ck U {.Sfc}. Any tuple t which is a solution of 
Ck+i = Cfc U {sfc} must be such that the restriction of t to the set of variables 
occurring in Ck is a solution of Ck, and the restriction of t to the set of variables 
occurring in Sk is a solution of Sk- Whence a{Ck+i) C a{Ck) ^ M{sk). Conversely, 
ii t £ (T{Ck) ^ M{sk), then by construction t satisfies Ck as well as Sk, whence t 
satisfies Ck+i = U {sk}. Therefore a{Ck+i) = cr(Cfc) N M{sk). □ 

5 Solving constraint-satisfaction problems 

What does it mean to "solve" a CSP? It is rare that the solution set a has but few 
elements, as it does in Sudoku. Though occupying only a small proportion of the 
type, a may have a finite and overwhelmingly large number of elements; it may also 
be an infinite set. Hence we can typically only hope to obtain some information 
about a. Useful information can come in the form of an approximation. 

If the approximation domain consists of computer-representable sets, as it typi- 
cally does, then Dcr is computer-representable, but will usually give too little infor- 
mation about a. But Dcr is useful in case one can show that it is empty: in that 
case a is empty; i.e. the CSP has no solutions. This is an advantage of treating 
numerical problems as CSPs: in conventional computation one can only conclude 
that no solutions were found. By formulating the problem as a CSP with intervals 
as approximation structure one may be able to show that no solutions exist. 

In case it is not possible to show that □cj is empty, one subdivides the box under 
consideration and one may be able to show that one of these subdivisions has no 
solutions. Let box P ( "probe" ) be such a subdivision. We use it to reduce the partial 
solution of the problem of determining a to that of determining any solutions that 
might occur in P, or to find, also usefully, that no solutions occur in P. Thus we 
proceed to obtain information about a P. This intersection is in general not a 
box, so is not necessarily computer-representable. Hence it is an appropriate task 
for an algorithm to determine □ (cr n P) for a given CSP and a suitable P, or an 
approximation to □(an P) (which is itself an approximation) . 

Subdivision of P should result in subsets of P whose union includes P. These 
subsets are subject to the same consideration: if absence of solutions cannot be 
shown and if amenable to subdivision, the process repeats for such a subset. Any 
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box P defines a tree of subsets to be processed in tliis way: solving a CSP requires, 
in addition to an attempt to show tlie absence of solutions in a given box, a search 
over the tree of subboxes of the initially given box. The "solution" of a numerical 
CSP is necessarily a list of boxes each of which is too small to subdivide and of 
which the absence of solutions cannot be shown. Of a solution x G TZ^ the best one 
can typically do is to fail to show that □({x}) contains no solutions of the CSP. 

5.1 Contraction operators 

A contraction operator transforms a box B into a box B' C B such that there is no 
solution in B \ B'. Two kinds of contraction operators on boxes are defined here: 
operators defined by relations, and operators defined by constraints. 

5.1.1 Contraction operators defined by a relation 

Definition 10 Let D be an approximation domain and I an index set. Any relation 
r of type I ^ D determines the mapping ")r{P) = ^{rf^P), the contraction operator 
of r, that maps boxes with type I ^ D to boxes with the same type. 

Benhamou and Older [3J introduced this formula for intervals of reals. Here it 
is generalized to approximation systems in general. 

Lemma 3 The contraction operator 7,. is idempotent, monotonic, inflationary and 
correct. 

Proof We have that □(□(r n P) n P) = □(r n P) n P = □(r n P); hence 7^ is 
idempotent. 

□ is monotonic and intersection is monotonic in both arguments, so is mono- 
tonic. 

7^(P) = □(r n P) C DP = P, so that P C 7r(P). That is, 7^ moves up in the 
(information) partial order: 7^ is inflationary. 

We have that r n (P \ 7r(P))) = meaning that is correct in the sense that 
it does not remove any part of r from its argument. 

An example of a contraction operator The contraction operator for the sum 
constraint acting on a box 

({x} ^ [a, 6]) N {{y] ^ [c,d]) M {{z] ^ [e,/]) 

where a, b, c, d, e, / are finite IEEE-standard floating-point numbers: 

(({^} ^ [a,b]) N {{y} ^ [c,d]) N {{z} ^ [e,/])) = 

({x} ^ [a',b']) N ({y} ^ [c',d']) N {{z} ^ [e',/']) 

where 

[a',b'] = [a,6]n[(e-d)-,(/-c)+] 
[c',d'] = [c,d]n[{e-b)-,{f-a)+] 
[e'J'] = [e,/]n[(a + c)-,(6 + d)+] 
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where superscript ~ means that the floating-point operation is performed in round- 
toward-minus-infinity mode and superscript means that the floating-point oper- 
ation is performed in round-toward-plus-infinity mode. In this way correctness of 
Isum is maintained in the presence of rounding errors. 

In Equation ([1]) the contraction operator is applied in the case where a = 0, 
6 = 2, c = 0, d = 2, e = 3, and / = 5. Applying jM{sum{x,y,z)) this special case 
gives 

[a',b'] = [0,2]n[l,5] = [1,2] 
[c',d'] = [0,2]n[l,5] = [1,2] 
[e'J'] = [3, 5] n [0,4] = [3,4] 

This only gives the general idea. An actual algorithm has to take care of the 
possibility of overflow. It also has to allow for the possibility that a, c or e are 
— oo and that b,d ov f may be -|-oo so that the undefined cases (+oo) — (+oo), 
(— oo) -|- (+oo), and (+oo) -|- (— oo) have to be circumvented. For details about such 
algorithms see |6j. 

5.1.2 Contraction operators defined by a CSP 

In the CSP defined by the constraints {soj ■ • • j^m-i}) let us write ai for M{si). 
Then Theorem [1] says that 

(T = (To N • • • N a„i-i- 

The 7 of Definition [To] is not useful for r = a, but it can be useful for the r = ai, the 
solution sets for the constraints by themselves. In fact, the constraints are chosen 
to be such that one has an efficient algorithm for each 70-^. 

Definition 11 Given a CSP {{sq, . . . ,Sm-i},V,D,M) with Vi as set of variables 
of Si. We define for this CSP 

j,{P) = TTyH^^X^V.P)) = 7r-\n{ain {ttv^P))), i = 0,...,m-l 

and 

r(P) = 7o(P)n---n7^_i(P), 

for any box P of type V ^ D. For i = 0, . . . , m — 1, 7^ is the contraction operator 
of Si, and T is the contraction operator of the CSP. 

Lemma A V is inflationary, monotonic, and correct. 

Proof Since, by Lemma El each 70-^ is inflationary, one has 

m—l 

□ ^y\^i^V.P)) 
= P 
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Hence T is inflationary. 

r is monotone, as a composition of monotone operators, since both projection 
TTVi and cylindrification vTy^ are monotone operators. 

Finally T is correct. Indeed, since by Lemma El each crj is correct, i.e. satisfies 
ai n (vry-P \ 7(7i(vryi-P)) = 0, one has, for any tuple /, that f £ {P \ T{P)) ^ f € 
P and 3i f ^ ^i{P) i.e., fv, (Ji i.e., f ^ a. Hence / G (P \ T{P)) implies /0 a, 
thus cr n (P \ r(P)) = 0. Therefore F is correct. □ 

A counter example to the idempotency of F is given by the CSP example discussed 
earlier, in Section [3l 

{sq{x, y), sq{y, z), sum{y, z, u), one(n)}. 

It is enough to take e.g. the approximation domain of (real) boxes included in 
{x, u, z, u} — )• TZ, the corresponding F operator operating on that domain, together 
with the box P informally described in equation ([HD, namely P = {f : {x, y, z, u} — )■ 
n:f{x) G [i,l],/(y) G [0,1], /(z) G [0,1], /(n) G [1,1].}. The sequence (F"(P))„eN 
is strictly decreasing until it stabilizes at the smallest box, in the approximation 
domain, containing the tuple / : {x,y,z,u} — )• TZ, such that f{x) = ■v/(^(\/5 — 
1)), f{y) = UV^ - 1), f{z) = i(V5 - 1)2, and /(n) = 1. 

5.2 Algorithms 

Algorithms for solving CSPs proceed by applying contraction operators. Hence the 
algorithms only remove tuples from consideration that are not part of the solution. 
In the course of this process absence of solutions of the CSP may be demonstrated, 
but in general solutions are not constructed. 

In the case of a discrete D it may happen that applying constraint contractors 
may result in a box that contains a single tuple. This tuple will then need to be 
substituted in the CSP to check whether it is a solution. However, in the type of 
CSP we are concerned with here (reals with floating-point intervals as approximation 
domain), finding a solution this way is but a remote theoretical possibility (the prob- 
lem would have to have an exact solution in terms of floating-point numbers, which, 
moreover, upon substitution would miraculously avoid rounding errors). Hence for 
numerical CSPs the best we can expect is an algorithm that results in a small box. 
This box can be small indeed: in double-length IEEE-standard floating-point arith- 
metic the box can have as projections intervals of relative width around 10^^^. The 
result shows that, if a solution exists, it has to be in that box. 

Among the algorithms that use contraction operators to solve CSPs we dis- 
tinguish two types of iteration according to the order in which the operators are 
applied. We distinguish rigid order from and flexible order. The latter type leaves 
more choice in the choice of the next operator to be applied. 

Consider a CSP (C, V, D, M) with contraction operators 70, ... , 7m-i ■ The rigid- 
order algorithm applies the m operators in such an order that between two successive 
applications of any particular operator all other operators are applied. The rigid- 
order algorithm is susceptible to improvement. In a typical CSP m can be in the 
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order of hundreds or thousands, whereas each of the constraints typically has few 
arguments. In numerical CSPs, for example, there are three or fewer. Usually each 
constraint shares an argument with several others. In such a situation most of the 
contractor applications have no effect: each application affects only few of many 
arguments and it may well be that the next operator belongs to a constraint that 
does not involve any of these few arguments, so that its application has no effect. 

This suggests a chaotic algorithm, one that avoids such ineffectual choices of 
operator application^. There is considerable scope for such optimization, as the 
only constraint on the sequence of operator applications is that this sequence be 
fair in the following sense. 

Definition 12 Let S be an infinite sequence of which the elements are elements of 
a finite set A. S is fair iff each element of A occurs infinitely many times in S. 

Thus, in a fair sequence, it is possible, but not necessary, that between two occur- 
rences of the same item all other items have occurred. 

A chaotic algorithm with m operators applies the operators in a fair sequence. 
Such an algorithm can generate a fair sequence while maintaining a record of the last 
index in the sequence where a change was effected. As soon as all the operators have 
been applied without any resulting change, then, by idempotence, the algorithm 
can be halted: the rest of the infinitely long fair sequence consists of operator 
applications that have no effect. For details, see [Ij. 

5.3 Optimality of the chaotic algorithm 

The chaotic algorithm is optimal in the sense that it computes a solution of the 
following problem 

maximize B 
subject to BQ r{B) 

where B ranges over the boxes in the approximation domain, and T is the F op- 
erator associated with the CSP. The problem is stated in a format borrowed from 
"mathematical programming" in the sense that this includes, for example, linear 
programming. In the above format the total order among real numbers has been 
replaced by the partial order which is the Scott information order described in Sec- 
tion m The generalization from the total order of mathematical programming to 
programming with partial orders is due to Parker who captures a wide variety of 
algorithms in this framework [7j. 

It is easily seen that the chaotic iteration gives the optimal result if the sequence 
generated by the algorithm converges to the least fixpoint of F. Note that C is the 
information order, where Bq C Bi iff each of the projections of Bi is a subset of the 
corresponding projection of Bq. 

^ The term "chaotic" has been adopted by the constraint processing literature via a detour from 
a numerical algorithm [4]. 
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Fixpoints We review some basic facts about fixpoints. Let {D, C, _L) be a com- 
plete partially ordered set. Completeness means here that every infinite ascending 
chain cq !^ ci C . . . has a least upper bound |J^o ^^^^ is an element of the 
partially ordered set. 

Let r G (D — > -D) be monotonic and continuous. Continuity of a function 
f e D ^ D means that for every infinite ascending chain cq E ci E • • • we have 
/(IJ^QCj) = [_\iZof{^i)- case of a finite D such as the partially ordered set 
of fioating-point intervals, monotonicity implies continuity. By the Knaster-Tarski 
theorem, T has a least fixpoint lfp{r) G D. This may be seen as follows. 

By monotonicity of F, 

-L c r(±) c r2(±) c ... 

By the completeness of the partially ordered set, |J^o ^"(-L) G D. By the continuity 
of r, 

OO OO OO 

ri\J r-(±)) = □ r(r-(±)) = y r"(±). 

n=0 n=0 n=0 

Hence jj^^o ^"(-L) is a fixpoint of F. 

We now turn to the Tarski fixpoint theorem. Let F G (.D — > .D) be monotonic, 
but assume now that partially ordered set {D,Q,1.) is a complete lattice, a richer 
structure. Completeness means here that any subset of D has a least upper bound 
and a greatest lower bound. In particular D possesses a largest element T. Then 
by the Tarski fixpoint theorem F has a least fixpoint lfp{r) G D. This may be seen 
as follows. 

Consider the set S = {a & D : F(a) 1^ a}. S is non-empty since it contains top 
element T £ D. Let I = nS be the greatest lower bound of S. Then for any element 
a e S, one has 

aeS^lHa^ T{1) C F(a) C a 

by monotonicity of F. Hence F(/) is lower bound for S, T{1) C I = FlS. Therefore 
I G S. One then has the chain of implications 

F(/) c / r(r(0) E F(/) ^ F(/) G 5 ^ / E r(/) ^ / = r(/). 

Hence I is a fixpoint of F. It is also the least fixpoint, since 5" contains every fixpoint, 
and / = nS. Therefore l = r\S = lfp(T) is the least fixpoint of F. 

Application of fixpoint theory to the chaotic algorithm 

Theorem 2 Given a CSP {C, V, D, M) with contraction operator F and solution 
set a. For any box P of type V D we have 

(anP) C □((jnP) C F"(P) 

for all n = 0,1,2, .. . 
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Proof The first inclusion follows from the definition of the □ operator. We consider 
the case where there are m = 2 constraints, which easily extends to arbitrary greater 
values of m. We write cxj = M(sj) and Vi for the set of variables in Sj, for i = 0, 1. 
We first consider the case n = 1. 

□ (crnP) = 

□ {a e{V ^ D) : G fXo A S cti A a G P} = 
□ {a ^ {V ^ D) : avb G c^o A ay^ G di A ay^ G -kvoP A avi G vry-^P} = 
□ {a G (y -D) : 014, G (ctq n TTy^P) A G (fii n vry^P)} = 
□ (^^i(aon7ry„P)n^^i(ain7ry,P)) c 
□ □ (fjo n 7ry„ P) n □ {ai n vry, P) ) = 

vry^n(fTon7ryoP)n^^^n(ain7ry,P) = 

7o(P)n7i(P) = 

r(p). 

We have shown that □(a n P) C r(P). We also have □(a n P) C r2(P). This is 
because of the correctness of F: it does not remove any solution tuples from its 
argument. Hence we have □(fi H P) C r"'(P) for any n > 0. 

□ 

By Definition [11] F is the intersection of contraction operators, one for each 
constraint, each of which can be efficiently computed. The results of these operators 
are exact in the sense that the results are by definition approximations and are 
therefore exactly represent able. Thus Theorem [2] can serve as the basis for an 
algorithm for approximating the set of solutions in P. 

In terms of the information order C Theorem [2] states that r"'(P) C □(crnP) C 

(anP). 

Theorem 3 F zs monotonic on the partially ordered set of subboxes of P ordered 
by information order. 

Proof Each contraction operator ji : P t-^ TTy^(jcri{P)) is monotone, and the join of 
two monotone operators is monotone. 

□ 

Observe that the set of boxes contained in P defines an approximation structure 
for P. As F is monotonic and the partially ordered set of subboxes of P ordered by 
information order is a complete lattice with least element P, F has a least fixpoint 
lfp{r), by Tarski fixpoint theorem. We have F"(P) C lfp{r) □ □(cr n P) C n P) 
for all n. 

If the box operator □ is continuous over the approximation domain defined over 
D, then F is also continuous by compositionality of continuous functions, and by 
the Knaster- Tarski theorem |Ji^o-'^*(-^) least fixpoint of F contained in P. 

In particular, if D is the set F of finite double- length IEEE-standard floating- 
point numbers, and the approximation domain is given by the set of P-intervals, 
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then domain D is finite, hence both operators □ and F are continuous. The subboxes 
of P form a complete partially ordered set trivially because the finiteness of the set 
of floating-point numbers. Therefore [J^Q^^iP) = Lir=o (-^) ' ^'-'^ some finite n, is 
the least fixpoint of T. 

One has the following (compare with [Ij, Theorem 7): 

Theorem 4 The limit of every fair iteration of contraction operators {'ji : i = 
0, ...,m — 1} starting with some box P, exists and coincides with {^^^q^^{P) = 

N,-eN Ti{P)- 

Proof Let {kn)n&¥i be some fair iteration, where for each n, A;„ G {0, . . . ,m — 1} 
is the index of the constraint s G {sq, . . . , Sm-i} selected at the nth iteration step. 
The corresponding iteration starting from some box P is given by sequence of boxes 

Po = P 

Pn = Jk„iPn-l), n>0 

First, observe that 

Vj 3q T^P) □ Pg (8) 

Indeed, since (A;„)„g]N is a fair sequence, and since all operators 7j are inflationary 
and monotone, in order to have inequality F-'(P) C Pq for a given j, it is enough to 
choose q such that initial iteration subsequence {kn)n<q contains, for each constraint 
si in C, at least j occurrences of index Z of in {0, . . . , m — 1}; these occurrences 
correspond to at least j applications of the contraction operator 7/. Second, observe 
that 

yq P, E T\P) (9) 

follows by induction on q. 

Whence U°^oFJ(P) □ l^f^^Pj by (HD, and V^f^^Pj E U°^oF^'(-P) by The two 
limits are equal. 

□ 

One deduces the following. 

Theorem 5 Let a CSP ({sq, • • • , Sm— ill -D, ikf), with contraction operator V, and 
contraction operators 7, for each individual constraint Si be given. If the approxi- 
mation structure over D is such that the box operator □ is continuous, then, for 
every box P, every fair iteration of continuous operators 7^ starting with P con- 
verges towards the least common fixpoint U^qF-' (P) of the operators 7^ contained 
in P. 

6 Conclusion 

A good exposition of how interval constraints fits in the existing framework of con- 
straint processing can be found in section 6.6 of Apt's textbook [2j. For background 
on constraint propagation algorithms, see Apt pj. 
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In this paper we have shown that constraint processing by domain reduction 
can be viewed as the use of the computer for monotonic gain of information. It 
remains to show that this is more than a theoretical point of view. What is lacking 
in the current practice of computing is a quantitative treatment of the work done by 
the cpu per, say, gigacycle. The domain reduction method can be used to compare 
how many gigacycles were required to obtain the most recent domain reduction, 
expressed, say, as ratio of the cardinalities, or volumes, of the box before and after 
this reduction. One may conclude that a reduction of x percent is not worth the y 
gigacycles it cost, that further diminishing returns for computational effort are to 
be expected, and that therefore it is time to terminate the iteration. 
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